Temporal control and training selection for HMM-based system

نویسندگان

  • Claude Barras
  • Marie-José Caraty
  • Claude Montacié
چکیده

Most speaker-independent acoustic-phonetic decoding systems are based on hidden Markov models. Such systems lack a real temporal control for the phonetic models. Furthermore, inter-speaker variability makes speaker adaptation necessary. In order to solve these problems, we introduce two original approaches. On the one hand, discontinuities detected with the ForwardBackward Divergence method are used to constrain phonetic transitions and to perform a more accurate temporal control. On the other hand, an efficient interspeaker measure, based on AR-vector models, allows the selection of a speaker neighbourhood and the adaptation of the phonetic models. The contribution of these two methods is estimated on the TIMIT database.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Fast and Efficient HMM-Based Face Recognition System Using a 7-State HMM Along With SVD Coefficients

In this paper, a new Hidden Markov Model (HMM)-based face recognition system is proposed. As a novel point despite of five-state HMM used in pervious researches, we used 7-state HMM to cover more details. Indeed we add two new face regions, eyebrows and chin, to the model. As another novel point, we used a small number of quantized Singular Values Decomposition (SVD) coefficients as feature...

متن کامل

MAN-MACHINE INTERACTION SYSTEM FOR SUBJECT INDEPENDENT SIGN LANGUAGE RECOGNITION USING FUZZY HIDDEN MARKOV MODEL

Sign language recognition has spawned more and more interest in human–computer interaction society. The major challenge that SLR recognition faces now is developing methods that will scale well with increasing vocabulary size with a limited set of training data for the signer independent application. The automatic SLR based on hidden Markov models (HMMs) is very sensitive to gesture's shape inf...

متن کامل

Speech enhancement based on hidden Markov model using sparse code shrinkage

This paper presents a new hidden Markov model-based (HMM-based) speech enhancement framework based on the independent component analysis (ICA). We propose analytical procedures for training clean speech and noise models by the Baum re-estimation algorithm and present a Maximum a posterior (MAP) estimator based on Laplace-Gaussian (for clean speech and noise respectively) combination in the HMM ...

متن کامل

Mixture splitting technic and temporal control in a HMM-based recognition system

In this paper, we study various technics to improve the performance, to reduce the computation cost and the required memory of a recognition system based on HMM. For the efficiency of the system, we first study the optimization of the number of HMM parameters according to training data. We experiment a temporal control of the phonetic transitions on lexical decoding task with a significant 5% i...

متن کامل

A Bayesian Approach to Temporal Data Clustering using Hidden Markov Models

This paper presents clustering techniques that partition temporal data into homogeneous groups, and constructs state based proles for each group in the hidden Markov model (HMM) framework. We propose a Bayesian HMM clustering methodology that improves upon existing HMM clustering by incorporating HMM model size selection into clustering control structure to derive better cluster models and part...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995